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POLYNU CLEOTIDE SF.OTrrv rgg 

The present invention relates to recombinant DN.A technologv. and i„ particular to 
nucleotide sequences (and expression products thereof, which are used in the production c: 

transgenic plants. 

The present invention provides. ,mer alia, nucleotide sequences useful in the 
Production of plants which show improved resistance to infection by microorganrs m s such a 

bacteria and fungi. 

According to the present invention there is provided a polynucleotide comprise a 
sequence selected from those depicted in SEQ ID No. 1, SEQ ID No. 2, SEQ ID No. . SEQ 
ID No. 4 and SEQ ID No. 5. 

A particularly preferred embodiment of the polynucleotide consists of nucleotides 
137 to 286 in SEQ ID No. 1, nucleotides 95 to 244 i„ SEQ ID No. 2, nucleotides 108 to ^7 
m SEQ ID No. 3, nucleotides 104 to 253 in SEQ ID No. 4 or nucleotides 1 77 to 326 in SEQ 
ID No. 5. Also included within the invention is the translation product of the region 
comprised by nucleotides 137 to 286 in SEQ ID No. 1, by nucleotides 95 to 244 & in SEQ ID 
No. 2, by nucleotides 108 to 257 in SEQ ID No. 3 or by nucleotides 104 to 253 in SEQ ID 
No. 4 and protein having an amino acid sequence which is at least 95% similar to said 
product. The translation product is an antimicrobial protein. The related antimicrobial 
proteins DmAMP 1 and Dm-AMP2 have been described in Published International Patent 
Application No. WO 93/05153. 

A further preferred embodiment of the polynucleotide consists of nucleotides ?87 to 
38, in SEQ ID NO. 1, nucleotides 245 to 334 in SEQ ID No. 2, nucleotides 258 to 317 in 
SEQ ID No. 3, nucleotides 254 to 343 in SEQ ID No. 4 or nucleotides 327 to 446 in SEQ ID 
No.5. These nucleotides encode protein sequences which may be used as cleavable linkers ir 
the co-expression of multiple proteins as is descnbed further herein. The invention norther 
extends to the translation product of nucleotides 287 to 385 in SEQ ID NO. 1 nucleotides 
245 to 334 in SEQ ID No. 2, nucleotides 258 to 3 1 7 in SEQ ID No. 3. nucleotides 254 to 343 

in SEQ ID No. 4 or nucleotides 327 to 446 in SFO in Vr, ^ , 

-/ ^ -+-40 in itg id Noo and protein havme an amino acid 

sequence which is at least 85% similar to said product. 

A yet norther preferred embodiment of the polynucleotide consists of nucleotides s 3 
o 1 36 ,n SEQ ID No. 1 , nucleotides 1 1 to 94 in SEQ ID No.2, nucleot.des 24 to 1 07 ,n SEQ 



ID No. 3, nucleotides 20 to 103 in SEQ ID No. 4 or nucleotides 1 to 176 in SEQ ID No. 5 
excluding the sequence encoding the intron marked at positions 65 to 156. These nucleotide 
sequences are signal sequences which may be linked to homologous and heterologous 
protein encoding regions to transport proteins extracellularly. The invention further extends 
to the use of said sequences as signal sequences. The invention further extends to the 
translation product of nucleotides 53 to 136 in SEQ ID No. 1, nucleotides 1 1 to 94 in SEQ 
ID No.2, nucleotides 24 to 107 in SEQ ID No. 3, nucleotides 20 to 103 in SEQ ID No. 4 or 
nucleotides 1 to 176 in SEQ ID No. 5 and protein having an amino acid sequence which is at 
least 85% similar to said product. 

It is preferred that the degree of similarity is at least 90%, more preferred that the 
degree of similarity is at least 95% and still more preferred that the degree of similarity is at 
least 97%. 

In the context of the present invention, two amino acid sequences with at least 85% 
similarity to each other have at least 85% similar (identical or conservatively replaced) amino 
acid residues in a like position when aligned optimally allowing for up to 3 gaps, with the 
proviso that in respect of the gaps a total of not more than 15 amino acid residues is affected. 
Likewise, two amino acid sequences with at least 90% similarity to each other have at least 
90% identical or conservatively replaced amino acid residues in a like position when aligned 
optimally allowing for up to 3 gaps with the proviso that in respect of the gaps a total of not 
more than 1 5 amino acid residues is affected. 

For the purpose of the present invention, a conservative amino acid is defined as one 
which does not alter the activity/function of the protein when compared with the unmodified 
protein. In particular, conservative replacements may be made between amino acids within 
the following groups: 

( i ) Alanine, Serine, Glycine and Threonine 

( ii) Glutamic acid and Aspartic acid 

(iii) Arginine and Lysine 

( iv) Isoleucine, Leucine, Valine and Methionine 

(v) Phenylalanine, Tyrosine and Tryptophan 

The invention also includes a polynucleotide encoding a protein having a 
substantially similar activity to any one of those encoded by SEQ ID No. 1 to SEQ ID No. 5. 



which polynucleot.de IS complementary to one which when incubated at a temperature o: 
between 55 and 65'C in 0.3 strength citrate buffered saline containing 0.1% SDS followed 
by rinsing at the same temperature with 0.1 strength citrate buffered saline contain^ 0.1% 
SDS still hybridises with a sequence depicted in SEQ ID No. 1 to SEQ ID No.5. with the 
5 proviso that said sequence is not that described in SEQ ID No. 6 or SEQ ID No. 7. 

The polynucleotide sequence provided in SEQ ID Nos 6 and 7 is the predicted DN A 
sequence for Dm- AMP 1 and Dm-AMP2 as described in Figure 3 1 A of Published 
Internationa! Patent Application No. WO 93/05153. 

The invention still ftirther includes a polynucleotide encoding a protein having a 
3 substantially similar activity to that encoded by nucleotides 137 to 286 in SEQ ID No. 1 bv 
nucleotides 95 to 244 in SEQ ID No. 2, by nucleotides 108 to 257 in SEQ ID No. 3 bv 
nucleotides 104 to 253 in SEQ ID No. 4 or by nucleotides 1 77 to 326 in SEQ ID No.5 which 
polynucleotide is complementary to one which when incubated at a temperature of between 
55 and 65°C in half strength citrate buffered saline containing 0. 1% SDS followed by rinsing 
at the same temperature with 0.3 strength citrate buffered saline containing 0.1% SDS still 
hybridises with the sequence comprised by nucleotides 137 to 286 in SEQ ID No. 1, bv 
nucleotides 95 to 244 in SEQ ID No. 2, by nucleotides 108 to 257 in SEQ ID No. 3 bv 
nucleotides 104 to 253 in SEQ ID No. 4 or by nucleotides 177 to 326 in SEQ ID No.5^th 
the proviso that said sequence is not that described in SEQ ID No. 6 or SEQ ID No. 7. 

When the test and inventive sequences (SEQ ID Nos 1 to 5) are double stranded the 
nucleic acid constituting the test sequence preferably has a TM within 15°C of that of the 
said inventive sequence. In the case that the test and inventive sequences are mixed together 
and are denatured simultaneously, the TM values of the sequences are preferablv within 5=C 
of each other. More preferably the hybridisation is performed under stringent conditions, 
with either the test or inventive sequences preferably being supported. Thus either a 
denatured test or any of the inventive sequences is preferably first bound to a support and 
hybridisation is effected for a specified period of time at a temperature of between 55 and 
65°C in 0.3 strength citrate buffered saline containing 0.1% SDS followed by rinsine of the 
support at the same temperature but with 0.1 strength citrate buffered saline. Where'the 
hybridisation involves a fragment of the inventive sequence, the hybridisation conditions 
may be less stringent, as is indicated in the immediately preceding paragraph. 



It may be desired to target the translation products of the polynucleotide to specific 
sub-cellular compartments within the plant cell, in which case the polynucleotide comprises 
sequences encoding chloroplast transit peptides, cell wall targeting sequences etc. 
immediately 5 1 of the regions encoding the said translation products. 

Translational expression of the protein encoding sequences contained within the 
polynucleotide may be relatively enhanced by including known non translatable translational 
enhancing sequences 5' of the said protein encoding sequences. The skilled man is very 
familiar with such enhancing sequences, which include the TMV-derived sequences known 
as omega, and omega prime, as well as other sequences derivable, inter alia, from the regions 
5' of other viral coat protein encoding sequences. 

In a particularly preferred embodiment of the invention, the polynucleotide is 
modified in that mRNA instability motifs and/or fortuitous splice regions are removed, or 
plant preferred codons are used so that expression of the thus modified polynucleotide in a 
plant yields substantially similar protein having a substantially similar activity/function to 
that obtained by expression of the unmodified polynucleotide in the organism in which the 
protein encoding regions of the unmodified polynucleotide are endogenous, with The proviso 
that if the thus modified polynucleotide comprises plant preferred codons, the degree of 
identity between the modified polynucleotide and a polynucleotide endogenously contained 
within the said plant and encoding substantially the same protein is less than about 60%. 

The invention also includes a plant transformation vector comprising a plant operable 
promoter, a polynucleotide sequence comprising all or part of the sequence selected from 
those depicted in SEQ ID No. 1, SEQ ID No. 2, SEQ ID No. 3, SEQ ID No. 4 and SEQ ID 
No. 5 under the transcriptional control thereof and encoding an antimicrobial protein, and a 
plant operable transcription terminator. The promoter may be constitutive or inducible. In 
particular, the promoter may be such that it induces transcription in response to application to 
the plant material containing it of a chemical 

The polynucleotide sequences provided in SEQ ID Nos 1 to 5 are related sequences 
with the translated products thereof showing a high degree of sequence similarity and it is 
believed that they may belong to a multi gene family. 

The invention still further includes plant tissue transformed with the said 
polynucleotide or vector, and material derived from the said transformed plant tissue, as well 



as morphologically normal fertile whole plants comprising the tissue or material. Such 
transformed plants include but are not limited to, field crops, fruits and vegetables such as 
canola, sunflower, tobacco, sugar beet, cotton, maize, wheat, barley, nee. sorghum, tomato, 
mango, peach, apple, pear, strawberry, banana, melon, potato, carrot, lettuce, cabbage, onion 
etc. Particularly preferred genetically modified plants are soya spp, sugar cane, pea, field 
beans, poplar, grape, citrus, alfalfa, rye, oats, turf and forage grasses, flax and oilseed rape, 
and nut producing plants insofar as they are not already specifically mentioned . 

The invention still further includes the progeny of the plants of the preceding 
paragraph, which progeny comprises the polynucleotide of the invention stably incorporated 
into its genome and heritable in a mendelian manner and the seeds of such plants and such 
progeny. 

The invention also provides a method of producing plants which are substantially 
tolerant or substantially resistant to antimicrobial infection, comprising the steps of: 

(.1) transforming plant material with the polynucleotide or vector of the invention; 

( ii) selecting the thus transformed material; and 

(iii) regenerating the thus selected material into morphologically normal fertile 
whole plants. 

Plant transformation, selection and regeneration techniques, which may require 
routine modification in respect of a particular plant species, are well known* to the skilled 
man. 

The invention will be further apparent from the following description taken in 
conjunction with the associated figures and sequence listings in which : 

Figure 1 shows the polynucleotide and corresponding amino acid sequences for A) Dml 
(SEQ ID No5) and B) Dm2.18 (SEQ ID Nol), 

Figure 2 shows the polynucleotide and corresponding amino acid sequences for A ) Dm2.1 
(SEQ ID No2) and B) Dm2.3 (SEQ ID No3) ? 

Figure 3 shows the polynucleotide and corresponding amino acid sequence for Dm2.5 (SEQ 
ID No. 4), 

Figure 4 shows a diagrammatic map of plasmids pMJBl, pDmAMPD and pDmAMPE; 
Figure 5 shows a diagrammatic map of plasmid pFAJ3106; 



Figure 6 shows a diagrammatic map of plasmid pFAJ3109 

Figure 7 shows the nucleotide sequence between the Xhol and Sad sites of pFAJ3 106; 

Figure 8 shows the nucleotide sequence between the Xhol and Sad sites of pFAJ3 109; 

Figure 9 shows a diagrammatic map of plasmid pZPS38; 

Figure 10 shows a diagrammatic map of plasmid pZPS34; 

Figure 1 1 shows a diagrammatic map of plasmid pZPS35; 

Figure 12 shows a diagrammatic map of plasmid pZPS37. 

Figure 13 shows a plan of the construction of the Dm- AMP gene 

Figure 14 shows the polynucleotide sequence for DmAMPl (SEQ ID No. 6) and Dm-AMP2 
(SEQID No. 7). 
EXAMPLE 1 

Dm Gene Isolation and Vector Construction 
Dahlia cDNA library construction 

Near-dry seeds were collected from flowers of Dahlia merkii . 

Total RNA was purified from the seeds using the method of Jepson et al . 

Seeds were frozen in liquid nitrogen and ground to a fine powder using a mortar and pestle. 

Phenol/m-cresol (9: 1) was added followed by RNA homogenisation buffer and the mixture 

ground until a fine paste was obtained. The mixture was spun, the aqueous phase collected 

and extracted twice with phenol/chloroform (1:1). Lithium chloride (12 M) was added to the 

resulting aqueous layer to a final concentration of 2 M and incubated overnight at 4°C. 

Precipitated RNA was collected by spinning at 13,000 rpm in an Eppendorf centrifuge and 

the RNA pellet re-suspended in 5 mM Tris-HCl, pH 7.5. A second overnight lithium chloride 

precipitation was carried out and the RNA collected and re-suspended in 5 mM Tris-HCl, pH 

7.5. 

0.6 mg of total RNA was obtained from 2 g of D. merkii seed. 

PolyATract magnetic beads (Promega) were used to isolate approximately 2 jag poly-A* 
RNA from 0.2 mg of total RNA. 

The poly-A^ RNA was used to construct a cDNA library using a ZAP-cDNA synthesis kit 
(Stratagene). Following first and second strand synthesis double stranded cDNA was size 
fractionated on a Sephacryl S-400 column. The three largest cDNA size fractions were 
pooled and ligated with vector DNA. After phage assembly using Gigapack Gold 
(Stratagene) packaging extracts, approximately 1 x 10 5 pfu were obtained. 



Probing 

Using oiigos AFP-5 (based on n-ter sequence CEKASKTW) and AFP-3EX ( based on C-ter 
sequence MCFCYFNC) and Dahlia genomic DNA. a 144 bp PCR product was produced and 
.solated from an agarose gel. The PCR product was cloned into pBluescnpt. The insert of 10 
transformants were sequenced. The sequences represented 3 Dm- AMP 1 genes one of which. 
PCR clone 4, encoded the observed mature Dm-AMPl. 

The 144 bp PCR product mixture labelled with a 3: -P d-CTP was used to probe Hybond N 
(Amersham) filter lifts made from plates containing a total of 6 x 10' pfu of the cDN'A 
library. The filters were hybridised at 46°C for 18 hrs in 5 x SSPE, 0.01 % SDS. 0.25% 
skimmed milk powder. Filters were washed in 2 x SSC, 0.1% SDS at 60°C. Autoradiography 
was carried out at -70°C with intensifying screens. Thirty potentially positive signals were 
observed. 22 plaques were picked and taken through two further rounds of screening. After 
in vivo excision 13 clones were characterised by DNA sequencing. 
Four classes of Dm- AMP related peptide were encoded by the 13 cDNA clones and the 
sequences of these peptides are provided in SEQ ID Nos 1- 4 in the accompanying figures. 
Three versions of the Dm- AMP core region were represented in the four classes. One of the 
classes (Dm2.5 type) contained a core region which may correspond to Dm-AMP2. 
None of the cDNAs encoded a core region equivalent to the observed mature Dm-.AMPl 
peptide sequence. 

Isolation of a Mature Dm-AMPl Gene 

Using the sequence of PCR clone 4 (above) and information from the NH : and COOH ends 
of the peptides described by cDNA sequences two pairs of oligonucleotides were designed 
for amplification of a gene encoding the observed mature Dm-AMPl . 

DahJia genomic DNA was PCRed with MATAFP-5P and MATAFP-5, the PCR product was 
cloned into pBluescnpt and clones were sequenced. A clone containing the 5' half of a Dm- 
AMPl gene was identified. 

Dahlia genomic DNA was PCRed with MATAFP-3 and DM25MAT-3, the PCR product was 
cloned into pBluescript and clones were sequenced. A clone containing the 3" half of a Dm- 
.AMPl gene was identified. 

The 5' and 3' sections of the mature gene were combined to assemble the sequence of the 
mature Dm-.AMPl gene (see below) . Exon 1, 64 bp encoding part of the leader peptide, 92 



bp intron and exon 2 encoding the end of the leader sequence, Dm- AMP 1 core and C- 

terminal extension. 

Vector Oligonucleotide design 

Four oligonucleotides were designed based on the DNA sequence of the mature Dm- AMP 1 
gene: 

DMVEC-1 top strand priming at the 5' end of the mature DmAMP-1 gene incorporating a 
Nco I site at the translation start of DmAMP-1 allowing cloning into pMJBl (see Figure 4). 
DMVEC-2 bottom strand priming in the 3' end of the C-terminal extension and a Sac I site 
for cloning in pMJB 1 . 

DMVEC-3 top strand priming at the 5' end of the mature DmAMP-1 gene incorporating a 
Nco I site at the translation start of DmAMP-1 allowing cloning into pMJBl also encoding 
complete signal peptide (minus intron). 

DMVEC-4 bottom strand priming in the 3' end of the core region and a Sac I site for cloning 
in pMJBl. 

Dahlia genomic DNA was used as template in a PCR with oligonucleotides DMVEC-1 and 
DMVEC-2. A product of approximately 450 bp was obtained, this PCR product was used as 
a template for PCRs described in vector construction below. 
Vector Construction 
pDmAMPD 

The PCR product obtained with DMVEC-1 and DMVEC-4 was cut with Nco I and Sac I the 
60 bp Nco I/Sac I fragment was isolated and cloned into pMJBlcut with Nco I and Sac I. 
The fragment in one of the resulting transformants was confirmed by sequencing, the clone 
was termed pDmAMPA. 

The PCR product obtained with DMVEC-3 and DMVEC-4 was cut with Nco I, the 150 bp 
Nco I fragment isolated and cloned into pDmAMPA cut with Nco I. The region of DNA 
encoding Dm- AMP leader plus core region in one of the resulting transformants was 
sequenced (see sequence below), the clone was termed pDmAMPD. 
pDmAMPE 

The PCR product obtained with DMVEC-1 and DMVEC-2 was cut with Nco I and Sac I the 
180 bp Nco I/Sac I fragment was isolated and cloned into pMJBlcut with Nco I and Sac I. 



The fragment in one of the resulting transformants was confirmed by sequencing, the clone 
was termed pDmAMPB. 

The PCR product obtained with DMVEC-3 and DMYEC-4 was cut with Xco I. the 150 b P 
Nco I fragment isolated and cloned into pDmAMPB cut with Xco I. The region of DXA 
encoding Dm- AMP leader, core plus C-terminal extension in one of the resulting 
transformants was sequenced (see sequence below), the clone was termed pDmAMPE. 
Both vector sequences contain PCR derived base changes with respect to the Dm- AMP I 
gene however the base changes do not result in amino acid substitutions. 
AFP-5 (to CEKASKTW) 

TG ( T , C ) GANAANGCN ( A , T ) ( G , C ) NAA { A , G ) ACNTGG 
AFP-3EX (to MCFCYFNC) 

CA ( A, G) TT (A, G) AANTANCANAAA (A, G) CACAT 
MAT AFP-5 P 

ATGGC ( C , G ) AAN (A, C) (A, G) NTC (A, G) GTTGCNTT 
MATAFP-5 

AAACACATGTGTTTCCCATT 
MATAFP-3 

AGCGTGTCATGTGCGTAAT 
Dm25MAT-3 

TAAAGAAACCGACCCTTTCACGG 
DMVEC-1 

ATCGTAGCCATGGTGAATCGGTCGGTTGCGTTCTCCGCG 
DMVEC-2 

AAACCGACCGAGCTCACGGATGTTCAACGTTTGGAAC 
DMVEC-3 

ATGCATCCATGGTGAATCGGTCGGTTGCGTTCTCCGCGTTCGTTCTGATCCTTTTCGTGC-3 



:atctcagatatcgca7ccgttagtggagaactatgcgagaaa 



DMVEC-4 
AGCAAGCTTT 
EXAMPLE 2 



Constructions of plant transformation vectors 

Expression cassettes containing a Dm- AMP 1 open reading frame functionally linked to an 
enhanced 35S promoter, a TMV omega translational enhancer and a Nos 3' region are 
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isolated as restriction fragments. pDmAMPD and pDmAMPE are both digested with the 
restriction endonucleases Hindlll and EcoRJ and the appropriate restriction fragment isolated 
and purified. Each fragment is ligated into a binary vector (a pBrN19 derivative named 
pBml9i) which has also been digested with Hindlll and EcoRI. The resulting constructs, 
named pDmAMPLC and pDmAMPLCC, incorporate the expression cassettes from 
pDmAMPD and pDmAMPE respectively. 

pDmAMPLC and pDmAMPLCC are subsequently introduced into Agrobacterium 
tumefaciens strain LBA4404 and introduced into tobacco and oil seed rape using standard 
plant transformation methodology. 

Plants are regenerated from callus tissue resistant to the selective agent kanamycin and 
expression of the Dm- AMP 1 product is monitored by standard Western blot or ELISA 
methods using antibody which had been raised against Dm- AMP 1 protein. A range of 
expression levels are detected. The Dm- AMP 1 expressed in selected transgenic is further 
characterised following extraction and partial purification from leaves of such lines. The 
product is of the predicted mass, as indicted by mass spectrometry. It is also demonstrated to 
retain biological activity after extraction as demonstrated by retention of antifungal activity 
in in-vitro (micro-titre plate) assays. 
EXAMPLE 3 

Constructions of plant transformation vectors for polyprotein expression 

Schematic representations of the plant transformation vectors used in this work, pFAJ3 106 and 
pFAJ3 109, are shown in figures 5 and 6, respectively. The nucleotide sequences comprised 
between the Xhol and Sacl sites of these plasmids, which encompass the regions encoding 
antimicrobial proteins, are presented in Figures 7 and 8. The regions comprised between the 
Xhol and Sacl sites of plasmid pFAJ3 106 (shown in Figure 7) was constructed following the 
two-step recombinant PCR protocol of Pont-Kindom G.A.D. (1994, Biotechniques 16, 1010- 
1011). Pnmers OWB175 (5 ' AGGAAGTTC ATTTC ATTTGG) and OWB279 (5'- 

GCCTTTGGCACAACTTCTGCCTCTTTCCGATGAGTTGTTCGGCTTTAAGTTTGTC); 
were used in a first PCR reaction with plasmid pDMAMPE (see above) as a template. The 
second PCR reaction was done using as a template plasmid pFRG4 (Terras F.R.G. et aL, 1995, 
Plant Cell 7, 573-588) and as primers a mixture of the PCR product of the first PCR reaction, 
primer OWB175 and primer OWB172 (5'TTAGAGCTCCTATTAACAAGGAAAGTAGC, 
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Sad sue underlined). The resulting PCR product was digested with Xhol and Sad and cloned 
into the expression cassette vector p.MJBl (see above). The expression cassette in the 
resulting plasmid. called P FAJ3099. was digested with Hindlll (flanking the 5" end of the 
CaMV35S promoter) and EcoRl .flanking the 3' end of the nopaline synthase terminator) and 
5 cloned in the corresponding sites of the plant transformation vector pGPTVbar (Becker D. et 

al., 1992, Plant Mol. Biol. 20. 1 195-1 197) to yield plasmid pFAJ3 106. 

Plasmid P FAJ3 1 09 was constructed by cloning the Hindm-EcoKL fragment of plasmid 

pDMAMPD (see above) into the corresponding sites of plant transformation vector pGPTVbar 

(see above). 

1 Plant transformation 

Arabtdopsts thaliana ecotype Columbia-0 was transformed using recombinant Agrobacterium 
tumefaciens by the inflorescence infiltration method of Bechtold N. et al. (1993, C.R. Acad. 
Sci. 316, 1194-1199). Transformants were selected on a sand/perlite mixture subimgated with 
water containing the herbicide Basta (Agrevo) at a final concentration of 5 mg/1 for the active 
ingredient phosphinothricin. 
Elisa assays and protein assays 

Antisera were raised in rabbits injected with either RsAf P2 (purified as described in Terras 
F.R.G. etal., 1992, J. Biol. Chem. 267, 15301-15309) or DmAMPl (purified as in Osborn 
R-W. et al., 1995, FEBS Lett. 368, 257-262). ELISA assays were set up as competitive type 
assays essentially as described by PennincLx I.A.M.A. et al. (1996, Plant Cell 8, 2309-2323). 
Coating of the ELISA microtiter plates was done with 50 ng/ml RsAFP2 or DmAMPl in 
coating buffer. Primary antisera were used as 1000- and 2000-fold diluted solutions 
(DmAMPl and RsAFP2, respectively) in 3 % (w/v) gelatin in PBS containing 0.05 % (v/v) 
Tween 20. 

Total protein content was determined according to Bradford (1976, Anal. Biochem. 72, 248- 
254) using bovine serum albumin as a standard. 
Purification and characterisation of expressed proteins 

Arabidopsis leaves were homogenized under liquid nitrogen and extracted with a buffer 
consisting of 1 0 mM NaH,P0 4 , 1 5 mM Na : HP0 4 , 1 00 mM KC1, 1 .5 M NaCl. The 
homogenate was heated for 10 min at 85=C and cooled down on ice. The heat-treated extract 
was centrifuged for 1 5 min at 1 5 000 x g and was injected on a reserved phase high pressure 
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liquid chromatography column (RP-HPLC) consisting of C8 silica (0,46 cm x 25 cm; Rainin) 
equilibrated with 0. 1 % (v/v) trifluoroacetic acid (TFA). The column was eluted at 1 ml/min 
in a linear gradient in 35 mm. from 15 % to 50 % (v/v) acetonitrile in 0.1 % (v/v) TFA. The 
eluate was monitored for absorbance at 214 nm, collected as 1 ml fractions, evaporated and 
5 finally redissolved in water. The fractions were tested by ELISA assays. 
Preparation of extracellular fluid and intracellular extract 

Intercellular fluid was collected from Arabidopsis leaves by immersing the leaves in a beaker 
containing extraction buffer (10 mM NaH 2 P0 4 , 15 mM Na 2 HP0 4 , 100 mM KC1, 1.5 M NaCl). 
The beaker with the leaves was placed in a vacuum chamber and subjected to six consecutive 
1 0 rounds of vacuum for 2 min followed by abrupt release of vacuum. The infiltrated leaves were 

gently placed in a centrifuge tube on a grid separated from the tube bottom. The intercellular 

fluid was collected from the bottom after centrifugation of the tubes for 15 min at 1800 x g. 

The leaves were resubjected to a second round of vacuum infiltration and centrifugation and 

the resulting (extracellular) fluid was combined with that obtained after the first vacuum 
15 infiltration. After this step the leaves were extracted in a Phastprep (BlOlOl/Savant) 

reciprocal shaker and the extract clarified by centrifugation (10 min at 10,000 x g) and the 

resulting supernatant considered as the intracellular extract. 

Characterization of transgenic plants and expression analysis 

To explore the possibility of expressing polyprotein precursor genes in plants, three different 
20 plant transformation vectors were made with the aim to co-express two different cysteine-rich 
plant defensins with antifungal properties, namely RsAFP2 and DmAMPl. The polyprotein 
precursor regions of these constructs all featured a leader peptide region from the DmAMPl 
cDNA, the mature protein domain of DmAMPl, an internal propeptide region, and the mature 
protein domain of RsAFP2. The three constructs differed only in the internal propeptides 
>5 . construct 3 1 06 has a propeptide consisting of a part of the DmAMP 1 propeptide and a 
putative subtilisin-like protease processing site (IGKR) at its C-terminus. 
The rationale behind construct 3 106, is based on our observations that the C-terminal 
propeptides of DmAMPl are cleaved off at their N-terminus when expressed as DmAMP I - 
preproproteins in tobacco, respectively, while this processing event does not detract the mature 
0 proteins from being sorted to the apoplast (De Bolle et al., 1996, Plant Mol. Biol. 31, 993- 
1008; R.W. Osborn and S. Attenborough, personal communication). This infers that the 



processing enzymes are either in the secretory pathway or in the apoplast. On the other hand. 
C-terminal cleavage of the internal propeptide in these constructs should be executed by a 
subtilism-like protease, a member of which in yeast (Kex2) is known to occur in the Golgi 
apparatus (Wilcox C. A. and Fuller R.S., 1991, J. Cell. Biol. 115,297- ). while a member in 
tomato occurs in the apoplast (Tornero P. eial, 1997, J. Biol. Chem. 272, 14412-14419). 
Proteins deposited m the apoplast, the preferred deposition site for antimicrobial proteins 
engineered in transgenic plants (Jongedijk E. eta!., 1995, Euphytica 85, 173-180; De Bolle et 
al., 1 996, Plant Mol. Biol. 3 1 . 993- i 008) are normally synthesized via the secretory pathway, 
encompassing the Golgi apparatus. 

Constructs were also made for expression of DmAMPl (construct 3 109, figure 6). 
Expression levels of DmAMPl and RsAFP2 were analysed in leaves taken from a series of Tl 
transgenic Arabidopsis plants resulting from transformation with the constructs described 
above. The results of the expression analyses based on Elisa assays are presented in Table 1. 
Most of the tested lines transformed with the polyprotein constructs 3 106 clearly expressed 
both DmAMPl and RsAPP2. There was generally a good correlation between DmAMPl and 
RsAFP2 levels. However, the RsAFP2 levels were generally 2 to 5-fold lower than the 
DmAMPl levels. It is not known whether the apparent lower expression levels of RsAFP2 
versus DmAMPl are real or whether they result from a bias in the extraction procedure or the 
assays. The expression levels in the lines transformed with the polyprotein constructs 3 106 
were generally much higher compared to those in lines transformed with the single protein 
construct 3109. Hence, the use of polyprotein constructs appears to result in enhanced 
expression, which is an unexpected finding. 
Analysis of the proteins expressed for polyprotein constructs 
A transgenic line was selected among each of the populations transformed with either 
construct 3 106 and the selected lines were further bred to obtain plants homozygous for the 
transgenes. In order to analyse whether DmAMPl and RsAFP2 were correctly processed in 
these lines, extracts from the plants were prepared as described in Materials and Methods and 
separated by RP-HPLC on a C8-silica column. Fractions were collected and assessed for 
presence of compounds cross-reacting with antibodies raised against either DmAMPl or 
RsAFP2 using Elisa assavs. 
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14 



DmAMPl cross-reacting compound eluted at a position identical or very close to that of 
authentic DmAMPl in the line transformed with construct 3 1 06. Likewise, a RsAFP2 cross- 
reacting compound was detected in the 3106 lines at an elution position identical or very close 
to that of authentic RsAFP2. None of the fractions reacted with both the anti-DmAMPl and 
anti-RsAFP2 antibodies, indicating that an uncleaved fusion protein was not present in the 
extracts. No cross-reacting compounds were observed in a non-transformed line. 
It is concluded that the primary translation products of the transcription units of construct 3 106 
(partial DmAMPl C-terminal propeptide with subtilisin-like protease site as a linker peptide) 
are somehow processed to yield separate DmAMPl -cross-reacting and RsAFP2-cross-reacting 
portions that appear to be identical or very closely related to DmAMPl and RsAFP2, 
respectively, based on their chromatographic behavior. 
Analysis of the subcellular location of coexpressed plant defensins 
In order to determine whether the coexpressed plant defensins are either secreted 
extracellularly or deposited intracellular^, extracellular fluid and intracellular extract fractions 
15 were obtained from leaves of homozygous transgenic Arabidopsis lines transformed with 
constructs 3106. The cytosolic enzyme glucose-6-phosphate dehydrogenase was used as a 
marker to detect contamination of the extracellular fluid fraction with intracellular 
components. As shown in Table 1, glucose-6-phosphate dehydrogenase was partitioned in a 
ratio of about 80/20 between intracellular extract fractions and extracellular fluid fractions. In 
20 contrast, the majority of DmAMPl and RsAFP2 content in all transgenic plants tested was 
found in the extracellular fluid fractions. These results indicate that both plant defensins 
released from the polyprotein precursors are deposited primarily in the apoplast. Hence, all 
processing steps that result in cleavage of the polyprotein structure must occur either in the 
apoplast or along the secretory pathway. 
25 Table 1 : Relative abundance of glucose-6-phosphate dehydrogenase activity (GPD), 

DmAMPl and RsAFP2 in the extracellular fluid (EF) and intracellular extract (IE) fractions 
obtained from transgenic .Arabidopsis plants. 

Construct Relative abundance 1 (%) of 



GPD DmAMPl RsAFP2 



EF IE EF IE EF IE 



pFAJ3106 17 83 94 6 60 40 



Relative abundance is expressed as % of the sum of the contents in the EF and IE fractions. 
EXAMPLE 4 

Expression of the sweet tasting protein Brazzein in tomato 

Production of transgenic tomato plants with increased accumulation of sweet tastin 2 
protein Brazzein. 

Constructs were prepared containing the Dahlia (Dahlia merckii) antimicrobial 
protein signal peptide fused with the Brazzein under the transcriptional control of the 
Arabidopsis polyubiquitin extension protein promoter (UBQ) or the Polygalacturonase 
promoter (PG). Constructs were also prepared which encoded Brazzein without a signal 
peptide but with an N-terminal methionine by the insertion of ATG nucleotides upstream of 
the Brazzein gene under the expressional control of either the UBQ promoter or the PG 
promoter. These were prepared as follows: 

Construction of the transformation vector for expression in tomato with Dahlia signal 
peptide fused to Brazzein under the expressional control of either the UBQ promoter or the 
PG promoter: 

A synthetic DNA was produced which coded for the with the Dahlia signal peptide 
fused to Brazzein. The codons were optimised for expression in tomato. Using appropriate 
restriction sites the coding sequence was cloned into a plasmid vector. The coding region 
was excised from the plasmid and cloned between the promoter in question and the 
terminator in the correct orientation for expression. 

Generation and analysis of plants transformed with the transformation vector. 

The vector was transferred to Agrobacterium tumefaciens LBA4404 (a 
microorganism widely available to plant biotechnologists) and used to transform tomato 
plants. Transformation of tomato stem segments followed standard protocols (e.g. Bird et al 
Plant Molecular Biology 11, 651-662, 1988). Transformed plants were identified by their 
ability to grow on media containing the antibiotic kanamycin. Up to 30 individual plants 
were regenerated with each construct and grown to maturity. The presence of the construct in 
ail of the plants was confirmed by polymerase chain reaction analysis. DNA Southern blot 
analysis on all plants indicated that the insert copy number was between 1 and 10. Northern 
blot analysis on fruit from one plant indicated that the Brazzein gene was expressed. 
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Brazzein production in the fruit of all plants was measured by ELISA (enzyme linked 
imunoabsorption assay) using a polyclonal and a monoclonal antibody raised against native 
Brazzein protein isolated from the fruit of the plant Pentadiplandra brazzeana Baillon Two 
fruit were collected from each transgenic plant at 7 days post breaker (the term breaker is 
used to indicate when the tomato fruit first show signs of the orange colouration 
characteristic of most mature tomato fruit). Total fruit protein was extracted from a sample of 
the pericarp of each of the fruit. The amount of Brazzein protein in the total protein extract 
was measured by ELISA and calculated as the amount of Brazzein per gram fresh weight of 
the fruit. For each plant the average Brazzein content of the two fruits was calculated. In 
some plants Brazzein could not be detected in the fruit using the ELISA technique. Western 
blot analysis of the total protein extract from some of the fruit revealed a 6.5kD protein band, 
which matches the predicted size of the mature Brazzein protein. This confirmed that the 
fruit contained Brazzein and that the signal peptide had been cleaved as if the signal peptide 
had not been cleaved, one would expect the protein to be larger. The Brazzein in fruit from 
plants which had been transformed with a construct lacking a signal peptide was not detected 
by Western blot. This is because the Brazzein content in these fruit is below the level of 
detection by western blot. ELISA is a more sensitive technique than western blot and protein 
was detected in these fruit by this method. 
The results are summarised in Table 2 below. 



TABLE 2 



j Construct 

j 

I Name 


Promoter 


I Signal 

i 

! Peptide 


*>0. 01 

P lo n tc 
X 1*111 

Tested 


Plants 

expressing 

Brazzein 


1 Max 
Brazzein 
ng/g Fresh 
wt 


Min Brazzein 
ng/g Fresh wt 


pZPS34 


UBQ 


None 


29 


18 


25.57 


Not Detected 


p2PS35 


UBQ 


Dahlia AMP 1 


25 


23 


226.53 


Not Detected 


pZPS37 


PG 


None 


15 


7 


12.77 


Not Detected 


pZPS38 


PG 


DahJia AMP1 


13 


11 


51745.77 


Not Detected 









CLAIMS 



A polynucleotide comprising a sequence selected from those depicted in 

SEQ ID No. I, SEQ ID No. 2. SEQ ID No. 3, SEQ ID No. 4 and SEQ ID No. 5. 

A polynucleotide comprising nucleotides 137 to 286 in SEQ ID No. 1, nucleotides 
95 to 244 in SEQ ID No. 2, nucleotides 108 to 257 in SEQ ID No. 3, nucleotides 104 
to 253 in SEQ ID No. 4 or nucleotides 177 to 326 in SEQ ID No. 5. 

A polynucleotide comprising nucleotides 287 to 385 in SEQ ID NO. 1, nucleotides 
245 to 334 in SEQ ID No. 2, nucleotides 258 to 3 1 7 in SEQ ID No. 3, nucleotides 
254 to 343 in SEQ ID No. 4 or nucleotides 327 to 446 in SEQ ID No.5. 

A polynucleotide comprising nucleotides 53 to 136 in SEQ ID No. 1, nucleotides 1 1 
to 94 in SEQ ID No.2, nucleotides 24 to 107 in SEQ ID No. 3, nucleotides 20 to 103 
in SEQ ID No. 4 or nucleotides 1 to 176 in SEQ ID No. 5. 

A polynucleotide according to claim 4 comprising nucleotides 1 to 176 in SEQ ID 
No. 5 excluding the sequence encoding the intron marked at positions 65 to 156. 

A polynucleotide encoding a protein having a substantially similar activity to that 
encoded by any of SEQ ID No. 1, No 2, No. 3, No,4 or No. 5 which polynucleotide 
is complementary to one which when incubated at a temperature of between 55 and 
65°C in 0.3 strength citrate buffered saline containing 0.1% SDS followed by rinsing 
at the same temperature with 0.1 strength citrate buffered saline containing 0.1% SDS 
still hybridises with the sequence depicted in SEQ ID No. 1, or No 2 or No. 3, or No. 
4 or No.5. with the proviso that said sequence is not that in SEQ ID No. 6 or SEQ ID 
No. 7. 



A polynucleotide encoding a protein having a substantially similar activitv to that 
encoded by nucleotides 137 to 286 in SEQ ID No. 1, nucleotides 95 to 244 in SEQ ID 
No. 2, nucleotides 108 to 257 in SEQ ID No. 3, nucleotides 104 to 253 in SEQ ID 
No. 4 or nucleotides 177 to 326 in SEQ ID No. 5., which polynucleotide is 
complementary to one which when incubated at a temperature of between 55 and 
65°C in half strength citrate buffered saline containing 0.1% SDS followed by rinsing 
at the same temperature with 0.3 strength citrate buffered saline containing 0.1% SDS 
still hybridises with the sequence comprised by nucleotides 137 to 286 in SEQ ID 
No. 1, nucleotides 95 to 244 in SEQ ID No. 2, nucleotides 108 to 257 in SEQ ID No. 
3, nucleotides 104 to 253 in SEQ ID No. 4 or nucleotides 177 to 326 in SEQ ID No. 
5. with the proviso that said sequence is not that in SEQ ID No. 6 or SEQ ID No. 7. 

A polynucleotide according to any preceding claim, further comprising a region 
encoding a peptide which is capable of targeting the translation products of the 
sequence to plastids such as chloroplasts, mitochondria, other organelles or plant cell 
walls. 

A polynucleotide according to any preceding claim, wherein translational enhancing 
sequences are inserted 5' of the protein encoding regions comprised by the 
polynucleotide. 

A polynucleotide according to any preceding claim, which is modified in that mRNA 
instability- motifs and/or fortuitous splice regions are removed, or plant preferred 
codons are used so that expression of the thus modified polynucleotide in a plant 
yields substantially similar protein having a substantially similar activity/function to 
that obtained by expression of the unmodified polynucleotide in the organism in 
which the protein encoding regions of the unmodified polynucleotide are 
endogenous, with the proviso that if the thus modified polynucleotide comprises plant 
preferred codons, the degree of identity between the modified polynucleotide and a 
polynucleotide endogenously contained within the said plant and encoding 
substantially the same protein is less than about 60%. 



A plant transformation vector comprising a plant operable promoter, a polynucleotide 
sequence according to claims 1 to 10 under the transcriptional control thereof and a 
plant transcription terminator. 

Plant tissue transformed with the polynucleotide of any one of claims 1 to 10 or the 
vector of claims 1 1 and material derived from the said transformed plant tissue. 

Morphologically normal fertile whole plants comprising the tissue or material of the 
preceding claim. 

The progeny of the plants of the preceding claim, which progeny comprises the 
polynucleotide of any one of claims 1 to 10 stably incorporated into its genome and 
heritable in a mendelian manner, the seeds of such plants and such progeny. 

A method of producing plants which are substantially tolerant or substantially 
resistant to microbial infection, comprising the steps of: 

(i) transforming plant material with the polynucleotide of any one of claims 1 to 
1 0 or the vector of claim 1 1 

(ii) selecting the thus transformed material; and 

(iii) regenerating the thus selected material into morphologically normal fertile 
whole plants. 

Use of the polynucleotide of any one of claims 1 to 10, or the vector of claim 1 1 in 
the production of plant tissues and/or morphologically normal fertile whole plants 
which are substantially tolerant or substantially resistant to microbial infection. 

The translation product of the region comprised by nucleotides 137 to 286 in SEQ ID 
No. L; nucleotides 95 to 244 in SEQ ID No.2, nucleotides 108 to 257 in SEQ ID No. 
3 and nucleotides 104 to 253 in SEQ ID No. 4. and protein having an amino acid 
sequence which is at least 95% similar to said product. 



The translate product of the region comprised by nucleotides 287 to 385 m SEQ ID 
NO. 1. nucleotides 245 to 334 ,n SEQ ID Xo. 2. nucleotides 258 to 317 in SEQ ID 
Xo. 3. nucleotides 254 to 343 in SEQ ID No. 4 or nucleotides 327 to 446 in SEQ ID 
No.5 and protein having an amino acid sequence which is at least 85% similar to said 
product. 

The translation product of the region comprised by nucleotides 53 to 136 m SEQ ID 
No. 1, nucleotides 1 1 to 94 in SEQ ID No.2, nucleotides 24 to 107 in SEQ ID No 3 
nucleotides 20 to 1 03 in SEQ ID No. 4 or nucleotides 1 to 1 76 in SEQ ID No. 5 and' 
protein having an amino add sequence which is at least 85% similar to said product 



A method of selectively controlling microorganisms at a locus compnsin, the plants 
progeny and/or seeds of either of claims 13 or 14, comprising applying to the locus a 
microorganism controlling amount of the translation product of the region comprised 
by nucleotides 137 to 286 in SEQ ID No. 1, nucleotides 95 to 244 in SEQ ID No 9 
nucleotides 108 to 257 in SEQ ID No. 3 and nucleotides 104 to 253 m SEQ ID No. 4. 

Use of the polynucleotide of any one of claims 1 to 10, or the vector of claim 1 1 in 
the production of an antimicrobial protein. 



Use of any one of the polynucleotides of claim 4 or claim 



5 as a signal sequence. 
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